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The invention relates to coding of an audio signal, in which sinusoids relevant 
for reproducing the audio signal are selected and of which parameters are encoded. 



10 



15 



20 



25 



Jn a sinusoidal audio encoder, at least part of an audio signal is represented by 
a plurality of sinusoids, which sinusoids are usually described by their frequencies their 
amplitudes and optionally their phases. In the encoding process, an audio signal is segmented 
m tune segments, which segments are analyzed for their frequency contents. Typically the 
segment size that is used in an audio encoder is within a range of 5 and 60 ms. For each 
segment a number of sinusoids are selected of which the parameters are subsequently coded 
In order to minimize the bit rate for a given audio quality, only relevant sinusoids need to be 
selected and encoded, i.e. only those sinusoids needed to reproduce the encoded audio signal 
in an acceptable perceptual quality. 

R. McAulay and T. Quartieri, "Speech analysis/synthesis based on sinusoidal 
representation." IEEE Transactions on Acoustics, Speech and Signal Processing 1986 
43:744-754, disclose a method to select sinusoids called peak-picking. The peak-picking 
method comprises a selection of those frequencies that have a peak in the amplitude 
spectrum. Another method for selecting sinusoids is an iterative process called matching 
pursuit as disclosed by the article from R. Heusdens and S. van de Par, "Rate-distortion 
opumal sinusoidal modeling of audio and speech using psychoacoustical matching pursuits" 
Proc. IEEE Int. Conf.Acoust. Speech and signal Proc. Orlando (USA), 200? Every 
iteration, the frequency having the highest peak in the amplitude spectrum is selected and is 
subsequently subtracted from the signal. The residual signal is used in the next iteration. The 
process is typically stopped when a fixed number of sinusoids are selected. 

A problem arising from the peak-picking method is that it is not known 
beforehand how many sinusoids are estimated since all peaks are selected. Especially when 
toe amplitude spectrum is noisy, too many sinusoids are selected. In contrast to peak-picking, 
the number of selected sinusoids in matching pursuit is fixed. As a consequence, in order to 
guarantee that all relevant sinusoids will be selected, this fixed number should be set high • 
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Again, too many sinusoids will be selected. The selection of too many sinusoids results in a 
high bit rate, since all of these sinusoids have to be encoded. Another disadvantage is the 
extra expenses in processing time. Perceptual modeling for example is a process used in 
many audio encoders in order to encode only that part of an audio signal that can be heard by 
a human ear. This modeling can be an expensive process and as a result, a large number of 
sinusoids that have to be analyzed is undesired. 



10 



An object of the invention is to provide audio encoding that is advantageous in 
tenns of bit-rate for a given audio quality. To this end, the invention provides a method of 
encoding, an audio encoder and an audio system as defined in the independent claims. 
Advantageous embodiments are defined in the dependent claims. 

A first aspect of the invention provides amethod of encoding an audio signal 
by representing at least part of said audio signal by a plurality of sinusoids, the method 
15 comprising the steps of performing an analysis on a first segment of the audio signal, 

selecting candidate sinusoids based on said analysis, defining for at least one of the candidate 
sinusoids a local frequency band around said candidate sinusoid's frequency, combining 
amplitudes of frequency components within said local frequency band from which at least 
one of the candidate sinusoids within said local frequency band is excluded and selecting said 
20 candidate sinusoid as a selected sinusoid in dependence on the combination of amplitudes. 
Said analysis for selecting candidate sinusoids will usually be a frequency analysis. Such a 
frequency analysis is for example used in conventional sinusoid selection techniques such as 
peak-picking or matching pursuit. For the selection procedure applied on said candidate 
sinusoids, an analysis is performed on a second segment of the audio signal. Usually, the 
second segment will be equal to the first segment used in the selection of candidate sinusoids, 
but this is not necessarily the case. By combining amplitudes of frequency components 
within said local frequency band from which at least one of the candidate sinusoids within 
. satd local frequency band is excluded, a measure is obtained for background frequency 
components within said candidate sinusoid's local frequency band. By using this measure, a 
better selection is made. Only selected sinusoids are encoded. As aresult, the selection 
procedure will result in a smaller number of sinusoids to be encoded for a given audio 
quality, which is advantageous in terms of bit-rate for a given audio quality. 

According to a further aspect of the invention, a bandwidth of said local 
frequency band around said candidate sinusoid's frequency is defined in dependence on said 



25 
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candidate sinusoid's frequency. Because of said dependence on said candidate sinusoid's 
frequency, the selection procedure can be tuned suitably for different frequencies. 

According to a still further aspect of the invention, said dependence on said 
candidate sinusoid's frequency is based on a human's perception of audio. An example of 
such a dependence is defined by a Bark bandwidth. A Bark is a unit of perceptual frequency, 
which is known in the art. Other examples are the Mel scale and the ERB scale, which are 
also known in the art. By taking the human's perception of audio into account, a better 
decision is made to select a candidate sinusoid as a selected sinusoid 

In an embodiment of the invention, said candidate sinusoid is selected as a 
selected sinusoid when its amplitude is significant with regard to said combination of 
amplitudes, which significance is evaluated by thresholding a difference between said 
candidate sinusoid's amplitude and a weighted mean amplitude of frequency components 
within said candidate sinusoid's local frequency band from which at least one of the 
candidate sinusoids within said local frequency band is excluded. By thresholding said 
difference, a suitable method is obtained for determining the peakiness of a candidate 
sinusoid. 

In a further embodiment of the invention, said significance of said candidate 
sinusoid's amplitude is evaluated by thresholding a ratio of a difference between said 
candidate sinusoid's amplitude and a weighted mean amplitude of frequency components 
within said candidate sinusoid's local frequency band from which at least one of the 
candidate sinusoids within said local frequency band is excluded, and a weighted deviation of 
the amplitudes of frequency components within said local frequency band from which at least 
one of the candidate sinusoids within said local frequency band is excluded. For said 
deviation, a definition of the standard deviation can be used for example. By thresholding 
said ratio, another suitable method is obtained for deterniining the peakiness of a candidate 
sinusoid. 

In a still further embodiment of the invention, a further selection procedure is 
applied on the selected sinusoids. This further selection procedure comprises the steps of 
determining for at least one of the selected sinusoids a phase consistency defined by an extent 
to which a phase of said selected sinusoid at a certain moment in time can be predicted from 
a phase of said selected sinusoid determined at another moment in time, and selecting said 
selected sinusoid as a further selected sinusoid when its phase consistency is above a 
predetermined threshold. The phase of said selected sinusoid at a certain moment in time can 
be predicted from the phase of said selected sinusoid determined at another moment in time, 
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as its frequency and the time difference between the time of prediction and the time of 
determination are known. The invention is based on the insight that when sinusoids are 
synthesized in a decoder in order to reproduce an encoded audio signal, the sinusoid's phases 
will be consistent. By selecting those sinusoids for encoding of which the phases are 
5 consistent, a better selection is made. The further selection is based on a sinusoid's phase, 
which is independent of its amplitude. Consequently, the further selection can lead to a 
further reduction of the number of further selected sinusoids in comparison to the number of 
selected sinusoids selected by the previous selection procedure. Only further selected 
sinusoids will have to be encoded. As a result, the further selection procedure will result in a 

10 smaller number of sinusoids to be encoded for a given audio quality, which is advantageous 
in terms of bit-rate for a given audio quality. Because of the independence between the 
selection procedure based on amplitudes and the further selection procedure based on phase 
consistency, it is also possible to perform both selection procedures in parallel. Both selection 
procedures then make a selection out of the candidate sinusoids, after which the results can 

15 be combined. 

In an even further embodiment of the invention, said selected sinusoid's phase 
consistency is determined by segmenting a third segment of the audio signal into at least a 
first and a second part, determining the actual phases of said selected sinusoid in at least the 
first and the second part, using the actual phase in the first part to serve as the input for 

20 predicting the actual phase in the second part, and determining said selected sinusoid's phase 
consistency based on a prediction error between the actual phase and the predicted phase in 
the second part. Usually, the third segment will be equal to tiie second segment used in the 
previous selection procedure, but this is not necessarily the case. An advantage of this 
embodiment is that said selected sinusoid's actual phase can be easily determined by 

25 performing a frequency analysis like an FFT procedure, for which analysis a part of the audio 
signal is needed as an input. 

The aforementioned and other aspects of the invention will be apparent from 
and elucidated with reference to the embodiments described hereinafter. 



In the drawings: 

Fig. 1 shows an embodiment of an audio encoder according to the invention; 
Fig. 2 shows a block diagram representing a selection procedure applied on 
candidate sinusoids according to the invention; 
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Fig. 3 shows an example of segmenting an audio segment in smaller parts for 

determining a selected sinusoid's phase consistency; 

Fig. 4 shows an embodiment of an audio system according to the invention. 

The drawings only show those elements that are necessary to understand the invention. 



Fig. 1 shows an embodiment of an audio encoder 1 according to the invention, 
comprising an input unit 10 for obtaining an input audio signal x(tj. The audio encoder 1 
separates the input signal into three components: transient signal components, sinusoidal 
signal components and noise signal components. The audio encoder 1 comprises a transient 
encoder 1 1, a sinusoidal encoder 12 and a noise analyzer 13. 

The transient encoder 1 1 comprises a transient detector (TD) 1 10, a transient 
analyzer (TA) 1 1 1 and a transient synthesizer (TS) 1 12. First, the signal x(t) enters the 
transient detector 1 1 0, the transient analyzer 1 1 1 and a subtracter 1 5. The transient detector 
1 10 estimates if there is a transient signal component and at which position. This information 
is fed to the transient analyzer 1 1 1 . This information may also be used in a sinusoidal 
analyzer (SA) 120 or a noise analyzer (NA) 13 to obtain advantageous signal-induced 
segmentation. The transient analyzer 11 1 tries to extract (the main part of) the transient signal 
component. This is for example done by matching a shape function to a signal segment and 
determining the content underneath the shape function, e.g. a (small) number of sinusoids. 
This information is contained in a transient code C T . The transient code C T is furnished to the 
transient synthesizer 112 and a multiplexer 14. The synthesized transient signal component is 
subtracted from the input signal x(t) in subtracter 15, resulting in a signal x, which is 
furnished to the sinusoidal analyzer 120 and a further subtracter 16. The sinusoidal analyzer 
120 determines the sinusoidal signal components. This information is contained in a 
sinusoidal code C s which is furnished to a sinusoidal synthesizer (SS) 121 and the 
multiplexer 14. From the sinusoidal code Cs, the sinusoidal signal components are 
reconstructed by the sinusoidal synthesizer 121. This signal is subtracted in subtracter 16 
from the input signal Xl . The remaining signal x 2 is devoid of (large) transient signal 
components and (main) sinusoidal signal components and is therefore assumed to mainly 
consist of noise. Consequently, the signal x 2 is furnished to the noise analyzer 13 where it is 
analyzed for its spectral and temporal envelope. This information is contained in a noise code 
C N . In the multiplexer 14, an audio stream AS is constituted which includes the codes C T> C s 
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and Ctf. The audio stream AS is furnished to e.g. a data bus, an antenna system, a storage 
medium etc. 

in the following, the selection of sinusoids in the sinusoidal analyzer 120 
according to an embodiment of the invention will be discussed. It is also possible to use the 
sinusoid selection procedure in the transient analyzer 111, though this is rarely done in 
practice as only a small number of sinusoids are analyzed there. 

Before the actual selection of sinusoids is performed, first a number of candidate sinusoids 
are selected. An analysis is performed on a first segment of the audio signal, from which 
analysis candidate sinusoids are selected This selection can for example be performed by 
conventional techniques like peak-picking or matching pursuit, which uses a frequency 
analysis on the first segment. The result will be a number of candidate sinusoids suitable for a 
more specific sinusoid selection procedure. Fig. 2 shows a block diagram representing a 
selection process applied on candidate sinusoids according to the invention. The frequencies 
of these candidate sinusoids are stored in F q = (f h f 2 , . . .Jfi with R the number of candidate 
sinusoids and the frequencies^ defined in Herz (Hz). A second segment can be windowed 
suitable for a frequency analysis, which results in a windowed segment Xy,. The second 
segment will usually be equal to the first segment used in the selection of candidate 
sinusoids, but also a different second segment can be used. First, a preprocessing stage (PP) 
is performed. In (I), for each frequency^ from F q> the candidate sinusoids are synthesized 
and subtracted from the windowed segment In (II), the resulting segment x ws is zero- 
padded to length P and analyzed for its frequency components by for example an FFT 
procedure. The resulting amplitude spectrum is denoted by ]X S [ Secondly, in (DDT), the 
segment x w is zero-padded to length P and analyzed for its frequency components without 
subtracting frequencies resulting in amplitude spectrum After the preprocessing stage, a 
selection procedure is started for at least one of the selected sinusoids having a frequency f t 
from F g initialized by (IV). In (V) a local frequency band is determined around said 
frequency/. For defining the local frequency band, different definitions can be used. In this 
case it is chosen to use a Bark bandwidth, e.g. defined by the critical bandwidth: 
b(f t ) = 25 + 75 • (1 + 1 .4 • 10- 6 . (l) 

From the critical bandwidth b(f$ defined in Herz (Hz) the boundary frequencies f a and f b are 
determined by: 
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(2) 



The spectrum is indexed with index i^a running from 0 to (P-l) in relation to the frequency 
fspea according to: 



in which F, is the sampling frequency (e.g. 44.1 kHz). Consequently, the indices i a and i b in 
the spectrum corresponding to the boundary frequencies f a and fi, are determined by: 



In which round(r) denotes to rounding of r to the closest integer. Now that the local 
frequency band is defined, a mean amplitude of the frequency band rn i of the candidate 
sinusoid is computed m (VI) from \K S \ by: 



in which A & {k) is the amplitude of the frequency component in the amplitude spectrum \X S \ at 
index k and W } (k) is a weighting factor dependent on index L The weighting factor can be 
constant for all k. However, the weighting factor can for example also be decreasing for an 
index k closer to one of the boundary frequency indices i a or i by in order to reduce boundary 
effects. The candidate sinusoid will be selected as a selected sinusoid in dependence on other 
amplitudes within its local frequency band. Therefore, a method to select the candidate 
sinusoid as a selected sinusoid is to use a criterion based on the weighted mean amplitude of 
the candidate sinusoid's frequency band w, as calculated in (5) and the amplitude of the 
candidate sinusoid A { = A(i fi ) of which the index / /( in the amplitude spectrum can be 
determined by: 



(3) 




(4) 




(5) 




(6) 
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In a further embodiment of the invention, the criterion used in the selection 
procedure also comprises a standard deviation of the candidate sinusoid's local frequency 
band, which is calculated in (VI) by: 



1 



£te<*)) 

5 hi which W 2 0c) is a further weighting factor dependent on index k The further weighting 

factor can be constant for all k. However, the further weighting factor can for example also be 
decreasing for an index k closer to one of the boundary frequency indices i a or z&, in order to 
reduce boundary effects. W2OO can be chosen equal to Wj(k) used in (5) but this is not 
necessarily the case. From the amplitude of the candidate sinusoid A i , the mean amplitude m t 
1 0 and the standard deviation a of the candidate sinusoid's frequency band, a ratio 77 can be 
defined that is a measure for a peakiness of the candidate sinusoid: 

r t ^- ^ (8) 

In the selection criterion (VIE) this ratio r* is compared to a threshold Ti. The threshold can 
for example be a fixed threshold or a threshold dependent on certain parameters like the 
15 frequency of the candidate sinusoid/, the index if of the frequency in the frequency 

spectrum and/or the number of samples P used for the frequency analysis. An example of a 
definition for the threshold Tt is: 

r,=(2./ /( )/(P/2).5+l (9) 

If the ratio r { is above the threshold Z>, the candidate sinusoid of frequency ft is 
20 kept for encoding (S). Otherwise the candidate sinusoid is rejected (NS). 

In a still further embodiment of the invention, a further selection of the 
selected sinusoids is performed. Therefore, the frequencies of the selected sinusoids based on 
the previous selection procedure are stored in F = (fi,f2> . . /£)> with L the number of selected 
sinusoids and the frequencies fi defined in Herz (Hz). On at least one of the selected 
25 sinusoids, the further selection procedure will be applied which is based on the phase 

consistency of the selected sinusoid. The selected sinusoid's phase consistency is defined by 
an extent to which a phase of said selected sinusoid at a certain moment in time can be 
predicted from a phase of said selected sinusoid determined at another moment in time. Next, 
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said selected sinusoid is further selected as a further selected sinusoid when said phase 
consistency is above a predetermined threshold. 

In an even further embodiment of the invention, the selected sinusoid's phase 



consistency is determined by first segmenting a third segment of the audio signal into smaller 
5 parts. This third segment will usually be equal to the second segment used in the previous 
selection procedure, but also a different third segment can be used. Two or more smaller 
parts have to be available for determining the selected sinusoid's phase consistency. The 
smaller parts can possibly overlap each other, but this is not necessarily the case. A third 
segment x s can for example be segmented into three overlapping smaller parts as shown in 
10 Fig. 3. If TV is the number of samples of the third segment and N is an even number, the 
smaller parts are defined by; 



each of these three smaller parts, the actual phases of the selected sinusoid having a 
1 5 frequency fi from F are determined. For this purpose the smaller parts can be windowed 

suitable for a frequency analysis, after which the frequency analysis can be performed like an 
FFT procedure. An example of the positions for a phase determination is shown in Fig. 3 by 
q>u (pi and <p 3 . Next, the phases can be predicted, in this case from smaller part 1 to 2, from 2 
to 3 and from 1 to 3. The differences between the actual and the predicted phases lead to the 
20 following prediction errors for the selected sinusoid: 



in which the prediction errors are in modulo sense (mod(2x)), the phases <pj, q>2 and <p 3 are 
given in radians, T is given in seconds and is defined by T = M/F s . Using a certain criterion 
based on these prediction errors E, the selected sinusoid can be further selected as a further 
25 selected sinusoid. A possible criterion migjit be a test if at least one of the following 
conditions is true; 



x f2 [k] = x s [k+M/2] (10) 
x H [k\ = x a [k+M1 



in which M = N/2 and 1 <k < M. The smaller parts x Si , x Sj and x Si each have a length M On 



E \* =((Pi-(<P 2 ~T/2-2n- f l ))mod{2n) 
^2,3 = fa - {<P2 + T/2 • 2n • f i ))mod(2^) 
= fa ~ i<Px + T ' 2 ^ * f t ))mod{27c) 



(11) 




Ks|<2-c 



(12) 



BEST AVAILABLE COPY 



PHNL02 1 345EPP 

10 18.12.2002 
in which c is typically dependent on the number of samples N of the third segment x $ and the 
number of samples M of the smaller parts jc^ , x Sj andx, 3 . An example of a definition for c is: 

2- k M 

c = (13) 

3- N 2 

Fig. 4 shows an embodiment of an audio system according to the invention 
5 comprising an audio encoder 1 as shown in Fig. 1 . Such a system offers recording and/or 
transmitting features. An audio signal x(t) is obtained by an audio signal obtaining device 41 
such as an audio player, a microphone or an audio input connector etc. The audio signal x(t) 
serves as the input for an audio encoder 1 as shown in Fig. 1. The output audio stream AS is 
furnished from the audio encoder 1 to a formatting unit 42, which formats the audio stream 
10 AS suitably for a communication channel 43 which may be a wireless connection, a data bus 
or a storage medium. In case the communication channel 43 is a storage medium, the storage 
medium may be fixed in the system or may also be a removable disc, a memory stick etc. The 
communication channel 43 may be part of the audio system, but will however often be 
outside the audio system. 

15 It should be noted that the above-mentioned embodiments illustrate rather than 

limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs between parenthesis shall not be construed as limiting the claim. The word 
'compromising* does not exclude the presence of other elements or steps than those listed in 

20 a claim. The invention can be implemented by means of hardware comprising several distinct 
elements, and by means of a suitably programmed computer. In a device claim enumerating 
several means, several of these means can be embodied by one and the same item of 
hardware. The mere fact that certain measures are recited in mutually different dependent 
claims does not indicate that a combination of these measures cannot be used to advantage. 

25 In summary, the invention provides a method of encoding an audio signal by 

representing at least part of said audio signal by a plurality of sinusoids, the method 
comprising the steps of performing an analysis on a first segment of said audio signal, 
selecting candidate sinusoids based on said analysis, defining for at least one of the candidate 
sinusoids a local frequency band around said candidate sinusoid's frequency, combining 

30 amplitudes of frequency components within said local frequency band from which at least 
one of the candidate sinusoids within said local frequency band is excluded, and selecting 
said candidate sinusoid as a selected sinusoid in case in dependence on the combination of 
amplitudes. The selection of sinusoids according to the invention will result in a smaller 
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number of sinusoids to be encoded for a given audio quality, which is advantageous in terms 
of bit-rate for a given audio quality. 
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CLAIMS: " ° G 1 



© 



1 . A method of encoding an audio signal by representing at least part of said 

audio signal by a plurality of sinusoids, the method comprising the steps of: 

performing an analysis on a first segment of said audio signal; 

selecting candidate sinusoids based on said analysis; 
5 - defining for at least one of the candidate sinusoids a local frequency band 

around said candidate sinusoid's frequency; 

combining amplitudes of frequency components within said local frequency 
band from which at least one of the candidate sinusoids within said local frequency band is 
excluded; and 

10 - selecting said candidate sinusoid as a selected sinusoid in dependence on the 

combination of amplitudes. 



2. A method as claimed in claim 1, wherein a bandwidth of said local frequency 
band around said candidate sinusoid's frequency is defined in dependence on said candidate 

15 sinusoid's frequency. 

3. A method as claimed in claim 2, wherein said dependence on said candidate 
sinusoid's frequency is based on a human's perception of audio. 

20 4. A method as claimed in claim 1, wherein said candidate sinusoid is selected as 

a selected sinusoid when its amplitude is significant with regard to said combination of 
amplitudes, which significance is evaluated by thresholding a difference between said 
candidate sinusoid's amplitude and a weighted mean amplitude of frequency components 
within said candidate sinusoid's local frequency band from which at least one of the 

25 candidate sinusoids within said local frequency band is excluded. 

5. A method as claimed in claim 1, wherein said candidate sinusoid is selected as 

a selected sinusoid when its amplitude is significant with regard to said combination of 
amplitudes, which significance is evaluated by thresholding a ratio of: 
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a difference between said candidate sinusoid's amplitude and a weighted mean 
amplitude of frequency components within said candidate sinusoid's local frequency band 
from which at least one of the candidate sinusoids within said local frequency band is 
excluded; and 

5 a weighted deviation of the amplitudes of frequency components within said 

local frequency band from which at least one of the candidate sinusoids within said local 
frequency band is excluded. 

6. A method as claimed in claim 1, wherein the method further comprises a 
10 further selection out of the selected sinusoids which comprises the steps of: 

determining for at least one of the selected sinusoids a phase consistency 
defined by an extent to which a phase of said selected sinusoid at a certain moment in time 
can be predicted from a phase of said selected sinusoid determined at another moment in 
time; and 

15 - further selecting said selected sinusoid as a further selected sinusoid when its 

phase consistency is above a predetermined threshold. 

7. A method as claimed in claim 6, wherein the determination of said selected 
sinusoid's phase consistency comprises the steps of: 

20 - segmenting a third segment of said audio signal into at least a first and a 

second part; 

determining the actual phases of said selected sinusoid in at least the first and 
the second part; 

using the actual phase in the first part to serve as the input for predicting the 
25 actual phase in the second part; and 

determining said selected sinusoid's phase consistency based on a prediction 
error between the actual phase and the predicted phase in the second part. 

8. An audio encoder for encoding an audio signal by representing at least part of 
30 said audio signal by a plurality of sinusoids, the audio encoder comprising: 

means for performing an analysis on a first segment of said audio signal; 
means for selecting candidate sinusoids based on said analysis; 
means for defining for at least one of the candidate sinusoids a local frequency 
band around said candidate sinusoid's frequency; 
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means for combining amplitudes of frequency components within said local 
frequency band from -which at least one of the candidate sinusoids within said local frequency 
band is excluded; and 

means for selecting said candidate sinusoid as a selected sinusoid in 
5 dependence on the combination of amplitudes. 

9, An audio encoder as claimed in claim 8, wherein the audio encoder is further 
conceived to perform a further selection out of the selected sinusoids for which further 
selection the audio encoder further comprises: 

10 - means for determining for at least one of the selected sinusoids a phase 

consistency defined by an extent to which a phase of said selected sinusoid at a certain 
moment in time can be predicted from a phase of said selected sinusoid determined at another 
moment in time; and 

means for further selecting said selected sinusoid as a further selected sinusoid 

1 5 when its phase consistency is above a predetermined threshold. 

10. Audio system comprising means for obtaining an audio signal, an audio 
encoder as claimed in claim 8 or 9 for encoding said audio signal to obtain an encoded audio 
signal, and a formatting unit for formatting the encoded audio signal into a format suitable for 

20 storage and/or transmission. 
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A method of encoding (1) an audio signal (x(t)) by representing (12) at least 
part of said audio signal by a plurality of sinusoids, the method comprising the steps of 
performing an analysis on a first segment of said audio signal, selecting candidate sinusoids 
based on said analysis, defining for at least one of the candidate sinusoids a local frequency 

5 band around said candidate sinusoid's frequency, combining amplitudes of frequency 
components within said local frequency band from which at least one of the candidate 
sinusoids within said local frequency band is excluded, and selecting said candidate sinusoid 
as a selected sinusoid in dependence on the combination of amplitudes. The selection of 
sinusoids according to the invention will result in a smaller number of sinusoids to be 

10 encoded for a given audio quality, which is advantageous in terms of bit-rate for a given 
audio quality. 
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